Chip Designers to Consider before Using Any of the Branch Prediction Schemes. 6.0 Conclusions
ثبت نشده
چکیده
After we reviewed our simulation results, we realize that the way we set the combined DHT/BHT table size may not have been the optimal. For simplicity, we assumed that the DHT and BHT components had the same number of table entries. We now feel that a the DHT should have more table entries than the BHT. We believe that the prediction rates will remain lower than the DHT-2 alone. The main advantage of the combined DHT-BHT is that in the BHT we can store the target address and thereby reduce the number of wasted cycles on calculating the target address each time. Given enough hardware, all prediction schemes perform roughly the same. Their relative differences are negligible considering standard deviations of approximately 2.5% from the mean of all benchmarks. The notable exceptions are TLA-6 and TLA-12 whose standard deviation are about 6%. Our most surprising result is that DHT-2 has roughly 90% prediction accuracies for buffer sizes in the 32-64 byte regions. Due to the high accuracy of the small DHT-2, we believe researchers should focus on small and simple branch prediction schemes. However, as is evident in the literature, researchers have been recently looking at increasingly complex branch prediction techniques. Our results demonstrate that these schemes do not improve prediction significantly and perform much worse for low hardware-cost implementations. Correlation based schemes overcome one level techniques only after they reach their saturation point. Because of the low hardware-cost implementations possible with DHT, processor designers should investigate into designing prediction units per process rather than per processor. We have not studied the effect of interleaving instruction streams through a branch prediction simulation and further study in this area is warranted. If target addresses are needed to reduce branch cycle penalties, we recommend combining a DHT-2 with a cache. The cache would store only taken branches using the branch instruction address as a tag and storing the associated target address. This is similar to the combined DHT/BHT proposed in [1] except that it is less complex. Again, in this case, multiple small DHT tables could be introduced to allow for multiple processes.
منابع مشابه
Procrastination: The role of prediction cognitive errors and perfectionism
The present study aims to predict procrastination through cognitive errors and perfectionism. The present research method is a descriptive correlation. The statistical population included all high school students in the east of Tehran in the academic year 1393-94, among whom 280 were selected by Simple Random Sampling. To collect data, the negative cognitive errors questionnaire (Leitenberg et ...
متن کاملCongestion estimation of router input ports in Network-on-Chip for efficient virtual allocation
Effective and congestion-aware routing is vital to the performance of network-on-chip. The efficient routing algorithm undoubtedly relies on the considered selection strategy. If the routing function returns a number of more than one permissible output ports, a selection function is exploited to choose the best output port to reduce packets latency. In this paper, we introduce a new selection s...
متن کاملAn Optimization on the DIN1.2080 Alloy in the Electrical Discharge Machining Process Using ANN and GA
Electrical Discharge Machining (EDM) process is one of the most widely used methods for machining. This method is used to form parts that conduct electricity. This method of machining has used for hard materials and therefore selects the correct values of parameters which are so effective on the quality machining of parts. Reaching to optimum condition of the DIN1.2080 alloy (D3) machining is v...
متن کاملAn Optimization on the DIN1.2080 Alloy in the Electrical Discharge Machining Process Using ANN and GA
Electrical Discharge Machining (EDM) process is one of the most widely used methods for machining. This method is used to form parts that conduct electricity. This method of machining has used for hard materials and therefore selects the correct values of parameters which are so effective on the quality machining of parts. Reaching to optimum condition of the DIN1.2080 alloy (D3) machining is v...
متن کاملOptimization of Material Removal Rate in Electrical Discharge Machining Alloy on DIN1.2080 with the Neural Network and Genetic Algorithm
Electrical discharge machining process is one of the most Applicable methods in Non-traditional machining for Machining chip in Conduct electricity Piece that reaching to the Pieces that have good quality and high rate of machining chip is very important. Due to the rapid and widespread use of alloy DIN1.2080 in different industry such as Molding, lathe tools, reamer, broaching, cutting guillot...
متن کامل